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Abstract 

Annual levels of US landfalling hurricane activity averaged over the last 11 years (1995-2005) are 
higher than those averaged over the previous 95 years (1900-1994). How, then, should we best predict 
hurricane activity rates for next year? Based on the assumption that the higher rates will continue we 
use an optimal combination of averages over the long and short time-periods to produce a prediction 
that minimises MSE. 



1 Introduction 

There is considerable commercial interest in the prediction of hurricane activity, although different in- 
dustries are interested in predictions over different time-scales. The insurance industry, for instance, is 
mostly interested in the year-ahead timescale, since year-ahead forecasts allow time for insurance rates 
to be adjusted appropriately. Motivated by this the purpose of this article is to investigate some of the 
properties of such year-ahead predictions. A complete year-ahead prediction would consider all aspects 
of hurricane activity such as hurricane intensity, size, timing, location, and so on. In this study, however, 
we will focus on one aspect only: the prediction of the annual number of hurricanes. 

One might divide methods for the year-ahead prediction of hurricane numbers into model-free and model- 
based methods. Model-free methods do not make any assumption s about what has driven variations in 
historical hurricane numbers. For example, the study described in lKhare and Jewsonl l|2005|) only makes 
the assumption that methods for hurricane prediction that have worked well in the past will work well in 
the future. The problem of finding good prediction methods then reduces to finding methods that would 
have worked well in the past. 

Model-based methods, on other hand, make much more specific assumptions, of the form of a model 
for what has happened in the past and what will happen in the future. The problem of finding good 
prediction methods then reduces to (a) choosing a good model and (b) understanding how best to make 
predictions, given the model. 

It is not, in general, possible to say which of model-free or model-based methods is better. The model-free 
method cited above is elegant because of the minimal assumptions, but may not work well if the future 
is very unlike the past. Model-based methods can always be criticized for the particular choice of model, 
which is always arbitrary, and indeed always wrong (at some level) , but they give more flexibility in terms 
of incorporating non-stationarities, and other assumptions. They are ideal for investigating how different 
assumptions lead to different conclusions. 

In this particular study, we investigate the possibility of using a model-based method for predicting the 
number of landfalling hurricanes for next year. We use one of the simplest possible non-trivial models. 
Our purpose is not actually to make accurate hurricane predictions per se (since one would probably want 
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to use a more complex model for that) but to illustrate the interaction between models and predictions 
based on those models, and, in particular, to highlight the trade-off between bias and error variance in 
this problem. 



2 Unbiased and biased categorical forecasts 

When making a forecast, it makes sense to state precisely what that forecast is trying to achieve, in terms 
of the error statistics of the forecast. For single- value forecasts typical goals might be that the forecast 
errors should be zero mean (i.e. the forecast is unbiased), or that the forecast errors should have low 
variance, or both. For probabilistic forecasts the goal might be that the forecast should maximise the 
probability of the observations given the forecast. 

In the single-valued forecast case there is often a trade-off between the twin goals of low bias and low 
variance, and it may not be possible to reduce one without increasing the other. The most obvious way 
to make this trade-off is then to focus on the single measure of MSE, which is a combination of the mean 
and the variance. One case in which such a trade-off occurs in meteorological forecasting is when we have 
historical values for a meteorological variable over a number of years, and each year falls into one of two 
categories, which we will call A and B. We assume that next year is going to be an example of category 
A. How, then, should we predict the variable? In the case in which we have many years of historical 
data for category A, or where the signal to noise ratio is large, the best single-value forecast is obvious: 
it consists of the mean of the historical data in category A. However, in the case where we only have a 
small number of historical years in category A, a larger number in category B, and the signal to noise 
ratio is small, how to make the best forecast is less obvious. The reason for this is that in this case the 
means of categories A and B combined may actually provide a forecast that is nearly as good, or even 
better (in terms of MSE), than the mean of category A. This is because category B contains more data 
than category A, leading to a more precise prediction, and this benefit may be more important than the 
harm caused by the data in category B being less representative than that in category A. Mathematically, 
the prediction based on category A has a zero bias but high variance (because of the small amount of 
data), while the prediction based on categories A and B together has a non-zero bias, but low variance 
(because there is more data). Either prediction could win in terms of MSE, depending on the details (the 
number of years of data in each category, and the strength of the signal). However, a better prediction 
than either is always a combination of the two predictions. 

The way we formulate the hurricane prediction question we have outlined in the introduction turns it 
into exactly this kind of problem. The most naive forecast for hurricane activity for next year is probably 
the average over the longest baseline for which we have reliable data. We will use hurricane count data 
from 1900, and so in our case that baseline is 1900-2005 (106 years of data). The argument for using 
such a prediction is that it is based on a lot of data. It is clear, however, that the time series of hurricane 
counts are not stationary d uring this period ( this is sue has been d i scusse d at length in the scientific 
literature: see, for example. iGoldenberg et all l|200l[) or iGrav et alJ (|l007^) . In particular, the last 11 
years (1995-2005) have shown more hurricanes than the long-term average, and the two available physical 
explanations for this (the Atlantic Multi-Decadal Oscillation and climate change) both suggest that this 
increased level will continue. This motivates the idea that a better forecast might be the average over this 
short baseline. The argument for this alternative prediction is that it should better capture the phase of 
whatever cycle or trend has caused the recent increase in numbers, while the disadvantage is that it is 
based on less data. 

We now put this problem into the categorical forecasting framework outlined above. Category A is the 
data for 1995-2005 (11 years), and we believe that 2006 is also going to be in category A. Category B is 
the data for 1900-1994 (95 years). We would expect a prediction based on the average of the hurricane 
activity in category A to be unbiased, but to have a high error variance because of the lack of data and 
the high level of noise. We would expect a prediction based on the average of the hurricane activity in 
categories A and B together to be biased, but to have a lower error variance because more data is being 
used. It is unclear a priori which prediction will have the lower MSE. 

The goal of this paper is to evaluate these two prediction methods and to derive the relations that give 
optimal combinations of them. To keep the problem as simple as possible, in order to illustrate the 



concepts involved, we will make the following straightforward assumptions: 



• We assume that annual hurricane numbers for 1900-2005 can be modelled as samples from poisson 
distributions 

• We assume that the mean of these poisson distributions was constant at one level from 1900 to 
1994, and constant at another level from 1995 to 2005. 

We could, as is always the case in such a modelling study, make our model more complex. For instance, 
it is not correct that mean hurricane rates were constant from 1900 to 1994: there are clear multidecadal 
shifts in activity during this period, as mentioned earlier. And the poisson distribution is not a perfect 
fit either, with the data having a slightly larger variance than the mean. However, we believe that this 
simple model is a useful addition to the debate about how to predict future hurricane numbers because 
it is simple, analytically tractable, and easy to understand. It also introduces and allows us to quantify 
the trade-off between mean error and error variance that lies at the root of this problem. 

We note that we have previously solved the same statistical prob lem for the case where the data is 
normally distributed rather than poisson l|,Iewson and Penzerl 120041 . The context for that study was the 
prediction of the impacts of El Nino on US temperatures. 



3 Individual vs. Overall Sample Mean 



In this section we derive the properties of the two most straightforward ways of making a categorical 
forecast from two category data: taking the sample mean of both the categories together, and taking 
the sample mean within each category. In the next section we will investigate mixing these two simple 
predictions in an optimal way. 

Consider random samples from two populations 

Yij ~ Pois(Aj), j = 1, . . . ,m (1) 
Y 2J ~ Pois(Ai), j = l,...,n 2 (2) 



Our interest lies in predicting the value (Yi ini _|_i) and the expected value (_E(Yi. ni +i)) of a new observation 
from (without loss of generality) the first population. We consider the sample mean of population 1 and 
the overall sample mean for populations 1 and 2 as predictors. First we define the sample mean as: 



1 "» 

•V ( 3 ) 

Our two predictors are then 

Yl.n 1 + 1 = Ai (4) 

YL 1+ x = -^^A 1 + ^^A 2 (5) 



The properties of these predictors are given below: 



E(Y 1<ni+1 -Y 1<ni+1 ) = (unbiased) (6) 

E{Y ltni+1 - E(Y lt7ll+1 )) = (unbiased) (7) 

Var(Yi, ni+ i - Y 1<ni+1 ) = (l + ^ Ai (8) 

Var(y ljni+1 - E(Y 1>ni+ i)) = — A x (9) 
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Defining MSEi as the mean squared error of predictions of Yi ;ni+ i, and MSE2 as the mean squared error 
of predictions of E(Yi ini+ \), we then have: 



MSEi(Yi )Tll+ i) = (l + l-^jX, (10) 

MSE 2 (Yi ;ni+1 ) = — Ai (11) 
m 



E{YL 1+1 -Y 1 , ni+1 ) = _^_(A 2 -Ai) (biased) (12) 
E(Xl ni+1 -E(Y ltni+1 )) = —^—(Aa-AO (biased) (13) 

V ^U + i-^- +1 )) = (^^A 1 + I ^^A 2 (15) 



MSEi(F^ +1 ) = (l+ g w )Ai + 7 " 2 , 2 A 2 +(— g— f^-Ai) 2 (16) 

(ni+n 2 ) 2 / (ni+n 2 ) 2 \ n i+ n 2/ 

"1 X "2 \ , f n 2 V /x , >2 



MSE 2 (F 1 t ni+1 ) = J-_Ai + 7 -^A 2 + 2— (A 2 — A1) 2 (17) 

' 1+ (ni+n 2 ) 2 (ni+n-2) 2 \ n i+"-2/ 

Which of these two predictions is better? The condition under which the overall sample mean is preferable 
to the individual sample mean (in terms of MSEi) as a predictor of Yi, ni+ i can now be derived, and is: 

MSE l( yt„ i+1 ) < MSE l( Yi, ni+1 ) <=► (A 2 - A l} 2 < ^ + 2n ^~ n ^ ( 18 ) 
The same relation holds for MSE 2 . 



4 General Mixed Predictors 



We now consider mixing the two forecasts discussed above to create an optimal categorical prediction. 
We consider the general mixed predictor 



Y 1 * ni+1 (a)=a\ 1 + (l-a)\ 2 (19) 

Note that 

= 5i* ni+1 (l), (20) 

YL 1+1 = Y 1 * ni+1 (n 1 /(n 1 +n 2 )) (21) 



The properties of the general mixed predictor are 



E{Yl ni+l {a)-Y 1 , ni+1 ) 
E(Yl ni+1 {a) - E{Y 1>n±+1 )) 

Var(y i :„ 1+1 (a)-y 1 ,„ 1+1 ) 
Var(Y i :„ i+1 (a)-i?(y 1 ,„ 1+1 )) 



(l-a)(A 2 -Ai) 
(1 - a)(X 2 - Ai) 



a 
ni 



Ai + 



(1 



n-2 



^a 1 + (L_ 

n\ n 2 



(22) 
(23) 

(24) 
(25) 
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(26) 



^MSE 1 (Y 1 * ni+1 (a)) = + Ai+ (1 - ^ A 2 + (1 - a) 2 (\ 2 - Ax) 

MSE 2 (Y 1 *„ i+1 (a)) = ^Ai+ (1 ^ a)2 A 2 + (l-a) 2 (A 2 -Ai) 2 (27) 

In order to find the "best" mixed predictor we minimize the MSE with respect to a 





MSE(y i * ni+1 (a))| a= .=0 (28) 



n 1 n 2 (X 2 - X 1 ) z + n 2 Xi + n 1 X 2 

Note that this is of the form 

a = (30) 

p + q 

where p = nin 2 (X 2 — Ai) 2 + niA 2 and q = n 2 X\. 



5 Example 



We now apply the equations derived above to the real case of landfalling hurricanes in the US. The mean 
number of landfalling hurricanes from 1900 to 1994 was 1.642105 1 , while the mean number of landfalling 
hurricanes from 1995 to 2005 was 2.181818 2 . 

Equation 1291 gives a value of a = 0.609. The values of bias, error standard deviation and RMSE for 
predictions made from the 3 forecasts discussed in section |21 and 0] above are given in table 1. We are 
principally interested in predicting the expected number of hurricanes (rather than just the number of 
hurricanes), and so we are most interested in SD2 and RMSE2. 

What we see in this example is: 



• Bias: model 1 (the forecast consisting of the average of the last 11 years) has the lowest bias (of 
zero), model 2 (the forecast consisting of the average of the last 106 years) has the most bias, and 
the level of bias from model 3 (the optimum mixture) is in between the two. 

• Error SD2: model 1 has the highest error SD, model 2 the lowest, and the level of error SD from 
model 3 is in between the two. 

1 There are various slightly different versions of the data for the number of US landfalling hurricanes because of (a) 
different corrections to obvious errors in the data and (b) different definitions of 'landf alling'. Our version is based as 
closely as possible on the SSS definition from the HURDAT database J.Tarvinen et al.Ul984ft . 

2 The hurricane season for 2005 is not quite finished yet: we are assuming that hurricane Alpha will be the last, giving 
a total of 4 landfalls for 2005. 



model 


prediction 


mean 


SD1 


SD2 


RMSE1 


RMSE2 


1 


2.182 


0.000 


1.543 


0.445 


1.543 


0.445 


2 


1.698 


0.484 


1.482 


0.127 


1.559 


0.500 


3 


1.971 


0.211 


1.503 


0.276 


1.517 


0.347 



Table 1: Predictions, and properties of those predictions, for three prediction models: model 1 (Y) is the 
'obvious', and unbiased, model based on a short recent baseline, model 2 (Y*) is the 'wrong', and biased, 
model based on a long baseline and model 3 (F^) j s the optimal combination of the two. 

• RMSE2: model 2 has a slightly higher RMSE than model 1, while the RMSE for model 3 is the 
lowest. 

The behaviour of the bias and error variances are all exactly as expected. It is interesting that the short 
baseline forecast (model 1) beats the long baseline forecast (model 2) in terms of RMSE, but only just. 
The optimal forecast has given us a forecast with lower RMSE than either of the components, as it should. 



6 Conclusions 

We have considered how to make model-based statistical predictions of landfalling hurricane numbers a 
year in advance. The method we use assumes that we can model annual hurricane numbers as poisson 
distributions, with one mean from 1900 to 1994, and another mean from 1995 to 2005. The means 
are unknown and must be estimated from data. We also assume that 2006 will come from the same 
population as the years 1995-2005. 

How best to predict the number of hurricanes for 2006 is not obvious because the data is noisy and 
we have only a few years of data at the recent higher level of activity. We derive expressions for the 
optimal prediction that can be made using a combination of the data from 1900-1994 with the data from 
1995-2005. Finally we apply these expressions to the real data for these periods to generate our optimal 
forecast. 

Our forecast is not intended as a genuine prediction of future hurricane activity, since there are other 
factors that one would want to take into account, such as the widely held belief that there have been 
changes in hurricane activity in the past (with high activity in the 1940s and 1950s for instance). This 
study could be extended to take these complexities into account. However, our study does illustrate the 
necessity for taking careful consideration of (a) the need to define what one is trying to predict and (b) 
the possibility that biased predictors may out perform unbiased predictors (in terms of RMSE) when we 
have little data or the signal to noise ratio is small. 

There is one statistical issue that we haven't discussed, and that would merit some further investigation. 
The value of a we use in our example is only an estimate of the best value of a, and could be rather 
different from the best value. This is likely to reduce the benefit of making the optimal combination. 
One way to understan d this better would be to ru n simulations, as we did for the corresponding normally 
distributed problem in ljewson and Khard l|2005|) . 

Finally we note that there are other ways that one can approach the same problem. For example, some 
might be tempted to use Bayesian statistics, and others to use bootstrapping methods. The presentation 
we have given, however, does seem to be the simplest of the various options. 
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